Overview

Dataset Statistics

Number of Variables 10
Number of Rows 17414
Missing Cells 8781
Missing Cells (%) 5.0%
Duplicate Rows 0
Duplicate Rows (%) 0.0%
Total Size in Memory 4.1 MB
Average Row Size in Memory 245.5 B
Variable Types
  • Categorical: 5
  • Numerical: 5

Dataset Insights

season has 8781 (50.42%) missing values Missing
count is skewed Skewed
time has a high cardinality: 17414 distinct values High Cardinality
time has constant length 19 Constant Length
is_holiday has constant length 3 Constant Length
is_weekend has constant length 3 Constant Length
season has constant length 6 Constant Length
time has all distinct values Unique
temp_feels_like_c has 519 (2.98%) negatives Negatives

Variables


time

categorical

Approximate Distinct Count 17414
Approximate Unique (%) 100.0%
Missing 0
Missing (%) 0.0%
Memory Size 1.4 MB

Length

Mean 19
Standard Deviation 0
Median 19
Minimum 19
Maximum 19

Sample

1st row 2015-01-04 00:00:0...
2nd row 2015-01-04 01:00:0...
3rd row 2015-01-04 02:00:0...
4th row 2015-01-04 03:00:0...
5th row 2015-01-04 04:00:0...

Letter

Count 0
Lowercase Letter 0
Space Separator 17414
Uppercase Letter 0
Dash Punctuation 34828
Decimal Number 243796
  • time has words of constant length

count

numerical

Approximate Distinct Count 3781
Approximate Unique (%) 21.7%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 272.1 KB
Mean 1143.1016
Minimum 0
Maximum 7860
Zeros 1
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • count is skewed right (γ1 = 1.3256)

Quantile Statistics

Minimum 0
5-th Percentile 53.65
Q1 257
Median 844
Q3 1671.75
95-th Percentile 3528.7
Maximum 7860
Range 7860
IQR 1414.75

Descriptive Statistics

Mean 1143.1016
Standard Deviation 1085.1081
Variance 1.1775e+06
Sum 1.9906e+07
Skewness 1.3256
Kurtosis 1.5438
Coefficient of Variation 0.9493
  • count is not normally distributed (p-value 5.233270309977525e-13)
  • count has 675 outliers

temp_real_c

numerical

Approximate Distinct Count 73
Approximate Unique (%) 0.4%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 272.1 KB
Mean 12.4681
Minimum -1.5
Maximum 34
Zeros 34
Zeros (%) 0.2%
Negatives 40
Negatives (%) 0.2%
  • temp_real_c is skewed right (γ1 = 0.2038)

Quantile Statistics

Minimum -1.5
5-th Percentile 4
Q1 8
Median 12.5
Q3 16
95-th Percentile 22
Maximum 34
Range 35.5
IQR 8

Descriptive Statistics

Mean 12.4681
Standard Deviation 5.5718
Variance 31.0452
Sum 217119.3333
Skewness 0.2038
Kurtosis -0.2619
Coefficient of Variation 0.4469
  • temp_real_c has 64 outliers

temp_feels_like_c

numerical

Approximate Distinct Count 82
Approximate Unique (%) 0.5%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 272.1 KB
Mean 11.5208
Minimum -6
Maximum 34
Zeros 232
Zeros (%) 1.3%
Negatives 519
Negatives (%) 3.0%
  • temp_feels_like_c is skewed left (γ1 = -0.0583)

Quantile Statistics

Minimum -6
5-th Percentile 1
Q1 6
Median 12.5
Q3 16
95-th Percentile 21.5
Maximum 34
Range 40
IQR 10

Descriptive Statistics

Mean 11.5208
Standard Deviation 6.6151
Variance 43.7601
Sum 200623.8333
Skewness -0.05835
Kurtosis -0.6601
Coefficient of Variation 0.5742
  • temp_feels_like_c has 19 outliers

humidity_percent

numerical

Approximate Distinct Count 143
Approximate Unique (%) 0.8%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 272.1 KB
Mean 0.7232
Minimum 0.205
Maximum 1
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • humidity_percent is skewed left (γ1 = -0.5727)

Quantile Statistics

Minimum 0.205
5-th Percentile 0.455
Q1 0.63
Median 0.745
Q3 0.83
95-th Percentile 0.93
Maximum 1
Range 0.795
IQR 0.2

Descriptive Statistics

Mean 0.7232
Standard Deviation 0.1431
Variance 0.02049
Sum 12594.6675
Skewness -0.5727
Kurtosis -0.256
Coefficient of Variation 0.1979
  • humidity_percent is not normally distributed (p-value 0.0005124714703403229)
  • humidity_percent has 86 outliers

wind_speed_kph

numerical

Approximate Distinct Count 103
Approximate Unique (%) 0.6%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 272.1 KB
Mean 15.9131
Minimum 0
Maximum 56.5
Zeros 68
Zeros (%) 0.4%
Negatives 0
Negatives (%) 0.0%
  • wind_speed_kph is skewed right (γ1 = 0.669)

Quantile Statistics

Minimum 0
5-th Percentile 5
Q1 10
Median 15
Q3 20.5
95-th Percentile 30.5
Maximum 56.5
Range 56.5
IQR 10.5

Descriptive Statistics

Mean 15.9131
Standard Deviation 7.8946
Variance 62.3242
Sum 277110.0833
Skewness 0.669
Kurtosis 0.4488
Coefficient of Variation 0.4961
  • wind_speed_kph has 236 outliers

weather

categorical

Approximate Distinct Count 7
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 1.2 MB
  • The largest value (clear) is over 1.52 times larger than the second largest value (scattered clouds)

Length

Mean 9.1646
Standard Deviation 4.9202
Median 6
Minimum 4
Maximum 22

Sample

1st row broken clouds
2nd row clear
3rd row clear
4th row clear
5th row clear

Letter

Count 151980
Lowercase Letter 151980
Space Separator 7613
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 0
  • The top 2 categories (clear, scattered clouds) take over 50.0%

is_holiday

categorical

Approximate Distinct Count 2
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 1.1 MB
  • The largest value (0.0) is over 44.35 times larger than the second largest value (1.0)

Length

Mean 3
Standard Deviation 0
Median 3
Minimum 3
Maximum 3

Sample

1st row 0.0
2nd row 0.0
3rd row 0.0
4th row 0.0
5th row 0.0

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 34828
  • The top 2 categories (0.0, 1.0) take over 50.0%
  • The largest value (00) is over 44.35 times larger than the second largest value (10)
  • is_holiday has words of constant length

is_weekend

categorical

Approximate Distinct Count 2
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 1.1 MB
  • The largest value (0.0) is over 2.5 times larger than the second largest value (1.0)

Length

Mean 3
Standard Deviation 0
Median 3
Minimum 3
Maximum 3

Sample

1st row 1.0
2nd row 1.0
3rd row 1.0
4th row 1.0
5th row 1.0

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 34828
  • The top 2 categories (0.0, 1.0) take over 50.0%
  • The largest value (00) is over 2.5 times larger than the second largest value (10)
  • is_weekend has words of constant length

season

categorical

Approximate Distinct Count 2
Approximate Unique (%) 0.0%
Missing 8781
Missing (%) 50.4%
Memory Size 598.6 KB

Length

Mean 6
Standard Deviation 0
Median 6
Minimum 6
Maximum 6

Sample

1st row winter
2nd row winter
3rd row winter
4th row winter
5th row winter

Letter

Count 51798
Lowercase Letter 51798
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 0
  • season has words of constant length

Interactions

Correlations

Missing Values